Subtitle Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech ( Preprint

نویسندگان

Seckin Uluskan

John H. L. Hansen

چکیده

A new adaptation strategy for distant noisy speech is created by phoneme class based approaches for context-independent acoustic models. Unlike the previous approaches such as MLLR-MAP adaptation which adapts acoustic model to the features, our phoneme-class based adaptation (PCBA) adapts the distant data features to our acoustic model which has trained on close microphone TIMIT sentences. The essence of PCBA is to create a transformation strategy which makes the distribution of phoneme-classes of distant noisy speech be similar to those of close microphone acoustic model in thirteen dimensional MFCC space (mostly in c0-c1 plane). It creates a mean, orientation and variance adaptation scheme for each phoneme class to compensate the mismatch. New adapted features, and new and improved acoustic models which are produced by PCBA are outperforming those created by MLLR-MAP adaptation for ASR and KWS. And PCBA offers a new powerful understanding in acoustic-modeling of distant speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Feature mapping using far-field microphones for distant speech recognition

Acoustic modeling based on deep architectures has recently gained remarkable success, with substantial improvement of speech recognition accuracy in several automatic speech recognition (ASR) tasks. For distant speech recognition, the multi-channel deep neural network based approaches rely on the powerful modeling capability of deep neural network (DNN) to learn suitable representation of dista...

متن کامل

A study on deep neural network acoustic model adaptation for robust far-field speech recognition

Even though deep neural network acoustic models provide an increased degree of robustness in automatic speech recognition, there is still a large performance drop in the task of far-field speech recognition in reverberant and noisy environments. In this study, we explore DNN adaptation techniques to achieve improved robustness to environmental mismatch for far-field speech recognition. In contr...

متن کامل

Minimum cost based phoneme class detection for improved iterative speech enhancement

It is known that degrading acoustic noise innuences speech quality across phoneme classes in a non-uniform manner. This results in variable quality performance for many speech enhancement algorithms in noisy environments. To address this, a hidden-Markov-model phoneme classiica-tion procedure is proposed which directs single channel speech enhancement across individual phoneme classes. The proc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Subtitle Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech ( Preprint

نویسندگان

چکیده

منابع مشابه

Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech

Allophone-based acoustic modeling for Persian phoneme recognition

Feature mapping using far-field microphones for distant speech recognition

A study on deep neural network acoustic model adaptation for robust far-field speech recognition

Minimum cost based phoneme class detection for improved iterative speech enhancement

عنوان ژورنال:

اشتراک گذاری